3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English Japanese
Availability:
Freely Available
License:
Creative Commons
Size:
8000 entries Production Status:
Existing-used
Use:
Bilingual Lexicon Extraction
-
Paper title:Bilingual Segmented Topic Model
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Poster - Monday
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Akihiro Tamura | National Institute of Information and Communications Technology | JP | ||
| Author 2 | Eiichiro Sumita | National Institute of Information and Communications Technology | JP | National Institute of Information and Communications Technology | N/A |
| Main Contact | Akihiro Tamura | National Institute of Information and Communications Technology | None |
Documentation:
<Not Specified>Language Type:
Multilingual
Languages:
English french
Availability:
Freely Available
License:
<Not Specified>
Size:
11000 <Not Specified>Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Collection of a Large Database of French-English SMT Output Corrections
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Marion Potet | <Not Specified> | None | ||
| Author 2 | Emmanuelle Esperança-Rodier | <Not Specified> | None | ||
| Author 3 | Laurent Besacier | <Not Specified> | None | LIG | None |
| Author 4 | Hervé Blanchon | <Not Specified> | None | ||
| Main Contact | Marion Potet | UJF-LIG | FR |
Documentation:
<Not Specified>
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution-ShareAlike 4.0 International License
Size:
4.4 GByte Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:T-REx: A Large Scale Alignment of Natural Language with Knowledge Base Triples
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Hady Elsahar | Université de lyon - UJM | FR |
| Author 2 | Pavlos Vougiouklis | Electronics and Computer Science, University of Southampton, UK | GB |
| Author 3 | Arslen Remaci | Univ. de Lyon, Laboratoire Hubert Curien | FR |
| Author 4 | Christophe Gravier | Univ. de Lyon Laboratoire Hubert Curien | FR |
| Author 5 | Jonathon Hare | Electronics and Computer Science, University of Southampton, UK | None |
| Author 6 | Frederique Laforest | Univ. de Lyon Laboratoire Hubert Curien | FR |
| Author 7 | Elena Simperl | Electronics and Computer Science, University of Southampton, UK | None |
| Main Contact | Hady Elsahar | Université de lyon - UJM | None |
Documentation:
English documentation available at: https://w3id.org/t-rexLanguage Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
not yet specified, but will be some open source license
Size:
3400 sentences Production Status:
Newly created-in progress
Use:
Evaluation/Validation
-
Paper title:Mapping Texts to Scripts: An Entailment Study
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Simon Ostermann | Saarland University | DE |
| Author 2 | Hannah Seitz | Universität des Saarlandes | DE |
| Author 3 | Stefan Thater | Universität des Saarlandes | DE |
| Author 4 | Manfred Pinkal | Saarland University | DE |
| Main Contact | Simon Ostermann | Saarland University | None |
Documentation:
-
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Attribution ShareAlike 3.0 Unported License
Size:
18439 entries Production Status:
Newly created-finished
Use:
Metaphor Novelty Scoring
-
Paper title:A Corpus of Metaphor Novelty Scores for Syntactically-Related Word Pairs
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Natalie Parde | University of North Texas | US |
| Author 2 | Rodney Nielsen | University of North Texas ; University of Colorado | None |
| Main Contact | Natalie Parde | University of North Texas | None |
Documentation:
http://hilt.cse.unt.edu/resources.html
Multimodal/Multimedia
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons (CC BY-NC-SA 4.0)
Size:
459 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:Evaluating the WordsEye Text-to-Scene System: Imaginative and Realistic Sentences
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Morgan Ulinski | Columbia University | US | ||
| Author 2 | Bob Coyne | Columbia University | US | Columbia University | N/A |
| Author 3 | Julia Hirschberg | Columbia University | US | ||
| Main Contact | Morgan Ulinski | Columbia University | None |
Documentation:
<Not Specified>
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Public Licenses
Size:
13100 sentences Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
-
Paper track:7.4 Speech synthesis paradigms and methods/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Seungju Han | LJSpeech | /N |
Documentation:
https://keithito.com/LJ-Speech-Dataset/
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Creative Commons Public Licenses
Size:
110 speakers, 400 sentences each OtherProduction Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
-
Paper track:7.4 Speech synthesis paradigms and methods/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Seungju Han | CSTR VCTK Corpus | /N |
Documentation:
https://datashare.is.ed.ac.uk/handle/10283/3443
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
34000 sentences Production Status:
Existing-used
Use:
Speech Recognition/Understanding
-
Paper title:Fusion Architectures for Word-based Audiovisual Speech Recognition
-
Paper track:10.1 Multimodal systems/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Michael Wand | GRID | /N |
Documentation:
Cooke et al., An Audio-Visual Corpus for Speech Perception and Automatic Speech Recognition (2006)
Speech/Written
Corpus,
Language Type:
Bilingual
Languages:
English
Availability:
From Owner
License:
Airbus
Size:
45 hours Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Automatic Speech Recognition Benchmark for Air-Traffic Communications
-
Paper track:10.6 Innovative products and services based on spe/Oral Presentation
-
Paper status:Accept - Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Juan Pablo Zuluaga | Airbus Challenge | /N |
Documentation:
https://arxiv.org/pdf/1810.12614.pdf




